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£3 ' Abstract 

Using an associated branching process as the basis of our approximation, we 
| show that typical inter-point distances in a multitype random intersection graph 

have a defective distribution, which is well described by a mixture of translated and 
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scaled Gumbel distributions, the missing mass corresponding to the event that the 
vertices are not in the same component of the graph. 

Keywords. Intersection graph, shortest path, branching process approximation, Pois- 
son approximation. 



1 Introduction 

> 

Bipartite graphs have been studied in a variety of applications: directors and companies 
[T9] . persons and questions in an intelligence test [IT], or genes and gene properties (21], to 
give just a few examples. Typically, in such applications, the graph induced on the vertices 
of one of the two parts, with vertices linked if there is a path of length 2 joining them in 
the bipartite graph, are of primary interest. For instance, the structure of the network 
linking directors may be of greater interest than the companies involved. Furthermore, 
in some applications, the remaining part of the bipartite graph, which is responsible for 
forming the links, may not be known or observable, and it may be of interest to deduce 
its existence from the properties of the observed part of the structure alone. However, 
the statistical properties of such bipartite graphs are not well understood, particularly 
when there are different types of vertices, see [19J. Here, we shall be concerned with the 
properties of a particular family of such graphs, known as random intersection graphs, 
and with the statistics of distances between randomly chosen points. 

Random intersection graphs are constructed from two sets, the 'vertices' and the 
'objects', as follows. Each vertex v € V is associated with a randomly chosen subset A v of 
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a finite set U of objects, and two vertices v and v' are joined in the graph if A v f]A v > ^ 0. In 
the simplest case, the Bernoulli model, vertex v is associated with object u independently 
of all other associations with fixed probability p. Britton et al. [6] establish a branching 
process approximation for the spread of a Reed-Frost epidemic on such a graph. Here, we 
consider the more flexible model in which there are K distinct types of vertices and J types 
of objects, and vertex v of type k is associated with object u of type j independently of all 
other associations with probability pkj. Our model can be viewed as a bipartite Erdos- 
Renyi mixture graph [Sj. In Erdos-Renyi mixture graphs, vertices are coloured, with the 
probability of two vertices being connected depending only on their colours; edges occur 
independently 

Random intersection graphs of this kind can also be related to the Rasch [T7] models 
in social science. These are given by taking 



For example, one might have k ~ j if person k solves problem j correctly; the a's would 
then relate to the ability of the person, and the /3's to the type of problem. A simplified 
Rasch model of the form p^ = akPj can be viewed as a special case of an exponential 
random graph model, see Equation (1) in [20j. 




In the study of random networks, the shortest distance between two randomly cho- 
sen vertices is one of the standard summary statistics. In this paper, we approximate 
its distribution for multitype Bernoulli intersection graphs. Since the networks used in 
applications are typically finite, we not only provide a limiting approximation, but also 
give explicit bounds on the difference between the true and limiting distributions. Our 
main results, summarized in Corollary 16.41 give an approximation described in terms 
of the distributions of the limiting random variables W of the associated multivariate 
bipartite branching process, when the process starts with a single individual of one or 
other of the types. The probability of the two vertices being in the same component is 
well approximated by the product of the probabilities that neither of the branching pro- 
cesses becomes extinct. On this event, the distance has a distribution close to that of a 
translation-mixture of scaled Gumbel distributions, with the mixture distribution being 
explicitly given in terms of those of the limiting random variables W. Alternatively, the 
approximate distribution can be described as that of (a linear transformation of) the sum 
of three independent random variables, one a Gumbel, and the others distributed as the 
logarithm of W, given the appropriate initial types. In a natural asymptotic framework, 
the error bounds behave like an inverse power of the total number of vertices, whose ex- 
ponent can be derived from the parameters of the bipartite graph: the probabilities p^, 
and the numbers nk of vertices, 1 < k < K, and rrij of objects, 1 < j < J, of the different 
types. 

The structure of the paper is as follows. The link between intersection graphs and 
branching processes is described in Section [2J The necessary distributional properties 
of the branching process are established in Section [31 and the extent to which it differs 
from the intersection graph is controlled in Section HJ The main theorem is then stated 
and proved in Section [6j and an application to exponential random graph models is 
given in Section [7J A key element in the proof is a Poisson approximation to coincidence 
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probabilities in a generalization of the hypergeometric sampling scheme; this is undertaken 
in Section [5j 

Although our motivation for studying this problem comes from the bipartite setting, 
one could equally well conduct a similar analysis for a graph without bipartite structure, 
recovering the general Erdos-Renyi mixture model; a corresponding approximation is 
given without detailed proof in (I6.3ip . However, the analysis for a 'general' graph would 
not easily imply our results as a special case, with the vertices split into two groups and 
with a bipartite matrix of edge probabilities P, because the 2-periodic structure would 
result in there being more than one eigenvalue of the mean matrix having largest modulus, 
and methods such as those of this paper would still be needed, to deal with the extra 
complexity that results. 



2 Intersection graphs and branching processes 

A random multitype intersection graph on the vertex set V = V\ U . . . U Vk is defined 
using a second set of 'objects' U — U\ U . . . U Uf. each vertex v E V k independently 
chooses a subset A v G U with distribution depending on k alone, and v ~ v' if and only 
if A v fl A v > 7^ 0. Here, we restrict ourselves to graphs derived from an underlying Erdos- 
Renyi bipartite mixture model, in which only edges e uv between u G U and v G V are 
possible, and these are present or absent independently, with probability p k j if u G Uj 
and v E Vk. Thus, in the random intersection graph itself, v ~ v' if and only if, for some 
u G U, both e uv and e uv > are present. 

Such a random graph can be constructed from a bipartite multitype branching process 
(Z(0),Z(1),Z(2),...) = (X(0),y(l),X(l),...), with X(i) G and Y(i) G N J for 
each i, together with sets of randomly assigned indices. Start with numbers X(0) = 
pTi(O), . . .,X K (0)) of individuals of the different types {(Jfc, 1), 1 < k < K}. The s-th 
individual (in some ordering) of type (k, 1) in Z(2r) = X(r) has offspring vector Y ks . r = 

{Y k ^i., ■ ■ ■ , Y k ^l), realized from the product distribution ®^ =1 Bi (nij,Pkj), where rrij is the 
cardinality of the set Uj, and the random vectors (Yfc s;r , 1 < k < K, s > 1, r > 0) are 
independent. We then set 

Z(2r + 1) = Y(r + l) = Y,Y1 ( 2 -l) 

fc=l s=l 

Similarly, the t-th individual of type (j, 2) in Z(2r — 1) = Y(r) has offspring vector 
Xjt-r = (Xj]^ r , . . . , Xj^), realized from the product distribution ®k =1 Bi (rik,Pkj), where 
rik is the cardinality of the set Vk, and the random vectors {Xj t - r , 1 < i < </, s > 1, r > 1) 
are independent of each other and of the Yk s - r - We then set 

Z(2r) = X(r) = J^/Z 



We also define 
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^mj! n := ^n fc . (2.2) 

j=l k=l 
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Throughout this paper we assume that rrij > 2 for 1 < j < J, and > 2 for 1 < k < K, 
and that X k (Q) < n k for 1 < k < K. 

To obtain the intersection graph, label each individual in the bipartite branching 
process with its line of descent. Thus 

{i; (k , s ), (ji, ti), (k u si), . . . , (j h U), (k h Si)} 

labels the s^-th individual of type (hi, 1) in generation 2i, which was descended from 
the tj-th individual of type (ji, 2) in generation 2i — 1, and so on. These labels are then 
augmented with indices from the index set appropriate to the type of individual, as follows. 
The Y^.. { type (j, 2) offspring of the typical vertex above are each assigned at random a 

unique index from a uniformly and independently chosen subset C^ S . H C {1,2, ... ,mj} 

of size 5^ a -i! a s i mnar construction is used for the offspring of objects. 

A further class identifier, or 1, is then attached to each individual: 1 if the individual 
is in generation zero, and thereafter, taking the individuals of the bipartite process in order 
of generation, but in any order within each generation, assign class if its parent was in 
class 0, or if its index had previously been assigned to another individual of the same type 
and of class 1, and 1 otherwise. The class individuals we refer to as ghosts. Edges are also 
created between the indices of a parent and its child if both are of class 1, or if the parent 
is of class 1 and the child of class 0', where class 0' indicates a class individual whose 
index was first assigned (therefore to a class 1 individual) in its own generation. Then the 
individuals that belong to class 1 correspond, via their indices, to the vertices and objects 
used in constructing the intersection graph, and two vertices have an edge between them 
if there are corresponding class 1 or class 0' individuals in the bipartite branching process 
that are at distance 2 from one another. In this way, the union of the components of 
the random intersection graph that contain the initial vertices is sequentially constructed 
according to distance from the initial vertices, the class 1 vertices in generation 2i of the 
bipartite branching process corresponding to the vertices that are at distance i in the 
random intersection graph from the initially chosen set of vertices. If these components 
do not exhaust all vertices, the process can be continued from any unused vertex, until 
all have been used. 

Two vertices, A of type ki and B of type k 2 , are at distance at least d + 1 from one 
another in the random intersection graph if the ^-neighbourhood of one of them in the 
bipartite graph does not intersect the ^-neighbourhood of the other. Constructing the 
random intersection graph from a bipartite branching process starting with one individ- 
ual A of type 1) and one, B, of type (k 2 , 1), this is the case exactly on the event that 
the set of all class 1 or class 0' descendants of A up to time d — both of types {k, 1) 
and of types (j, 2) — is disjoint from that of B. From the construction, these sets can 
only overlap if there is at least one class 1 descendant of either A or B having the same 
index as a class 0' descendant of the other, and then necessarily in the same generation of 
the bipartite process. Our main theorem consists of showing that the probability of this 
event can be well approximated by the probability of the corresponding event when all 
descendants are considered, and that this probability in turn can be well approximated 
using the theory of branching processes. 

The origin of the approximation lies in the following well known facts ([10] . II Theo- 
rem 9.2): that, on the event of non-extinction, a square integrable super-critical multitype 
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branching process, whose mean matrix is irreducible and aperiodic, has an asymptotically 
stable type distribution; and that the total number of individuals alive in each generation 
grows like a random multiple of a geometrically growing sequence. For the X branching 
process, this means that the number of individuals of type k in generation i is approxi- 
mately given by r l Wfik, where r is the largest eigenvalue of the mean matrix, /i T is the 
associated positive left eigenvector, and W is a non-negative random variable, positive on 
the event of non-extinction, and the same for all i and k. Hence the numbers of descen- 
dants X A {i) of A at the z-th generation of the X branching process are approximately 
given by T l W A ^k, and those of B by r l W B fik, where W A and W B are independent. When 
constructing the random intersection graph from the branching process, indices are as- 
signed to the vertices independently at random, with replacement. Links between the A 
and B neighbourhoods occur whenever, for some i > 1 and some 1 < k < K, one or more 
of the X B {i) are assigned the same index as one of the X A (i); other coincidences give rise 
to 'ghosts', and play no part in the intersection graph. The mean number of such events 
up to and including generation i is thus approximately 

i K 2 

J2r 2s W A W B J2— ~ K X n~ l T 2i W A W B , 



-i 



where 



K 



2 \ K 2 



i S 



T 2 _ 1 / „X ' 



and q£ := n^/n. A similar formula hold for links occurring because of coincidence of 
indices at the object level; here, the expected number of links up to and including gen- 
eration z — 1 is approximately k y n~ 1 r 2 ^~ 1 ^W A W B , where, because of (13. 3p . k y = tk x . 
Adding the two means gives an overall mean number of links approximately equal to 
Kn~ l T 2t W A W B , where 

Then, using Poisson approximation, it follows that the probability of there being no shared 
vertices in the i-neighbourhoods of A and B is approximately 

F I „-/ t n- 1 r 2 W A iy fl 

Mil, 



kite 1 C f > 



this being the probability that the distance between A and B in the intersection graph 
exceeds 2i. This line of reasoning is made precise in the coming sections, and the detailed 
results are to be found in Theorem I6.2I and Corollary | 



3 Counting the offspring 

We now study the bipartite branching process Z in greater detail. Our aim in this section 
is to justify the simple approximation, outlined above, to the numbers X k {i) of type-(/c, 1) 
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individuals in Z(2i) (or, equivalently, of type-/c vertices in the i-th generation of the vertex 
branching process) and Yj(i) of type-(j, 2) individuals in Z(2i — 1), (or of objects in the 
i-th generation of the object branching process). Theorems 13.61 and 13.71 below show that, 
for large i, X(i) ~ t 1 W \i and Y(i) ~ T % ~ x (,Wp,, the notation being as defined below. 



3.1 Assumptions and notation 

Let 

Nx = diag (ni, . . . , n#) and N Y = diag(rai, . . . ,mj) 

be the diagonal matrices of the numbers of different types of vertices, and of different 
types of objects, respectively; for future convenience, recalling f)2.2p . we define 



<£:=—, l<k<K; qj:=^, 1 < j < J. (3.1) 

n J m 

Let P denote the K x J-matrix of edge probabilities, and put 



M x = PN Y P T N X . 

Then the non-negative matrix Mx is the mean matrix of the X branching process. 

Assumption. We assume that the non-negative matrix Mx is irreducible and aperiodic, 
and has largest eigenvalue r > 1. 

We use v and fi T to denote respectively the right and left eigenvectors corresponding 
to r, with n k > 0, 1 < k < K, standardized so that = 1 and that /i T z/ = 1. We 
assume throughout that r > 1. We then define My '■— P T NxPNy to be the mean matrix 
of the Y branching process, and 

ft ■= C^NyP 7 ^ (3.2) 

to be the left eigenvector of My with ||/i||i = 1 corresponding to the eigenvalue r. Thus 
( := /i T PA^yl, where 1 denotes a J x 1-vector of l's. 
Note also that 

C 2yf^ = ( NyP T ^t n -x {NyP t } = ^ PNyP t 



777,1 



K 2 

lx T MxN^H = r/i T iV-V = tJ2—> 

k=i Uk 



so that 



say. 




CVm = r {T^ / {y,^ = 3 2 , (3.3) 
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We next define Cq to be the smallest value such that 

sup sup r _i a T M^e (fc) < c /i k , 1 < k < K; 

a: ||a||i=l i>0 

sup sup r^a T M^g {i) < co/%, 1 < j < J, (3.4) 

a: ||a||i=l «>0 

where e^ k ' and eP' denote the k and j unit vectors in and R J respectively. Further, 
with A2 the eigenvalue of Mx with second largest modulus, we define c\ such that 

sup sup |A 2 |~"1| (Mx - Tz//i T )''6||oo < ci, (3.5) 

b: ||6||oo=l i>0 

where we take (Mx — TUfi T )° := I — z//i T . Note that it follows from the Perron- Frobenius 
Theorem that Co and C\ are both finite; see [11J, Theorem 8.5.1, and \H>\, Chapter 1, 
Theorem 6.1. We also, for later use, write 

7 := max{r, |A 2 | 2 } < r 2 ; 9 := max(s + 1)(J^/t) s , (3.6) 

and introduce the notation T r to denote the cx-algebra cr{Z(t), < t < r}, and := 
a{X(l), < Z < i}, := a{Y(l), l<l<i}. 

3.2 Asymptotics 

The main results of the paper require no particular asymptotic setting. However, asymp- 
totics are useful for putting the results in the context of a natural limiting framework. 
One such choice is the following. Start by choosing the and rrij so that the proportions 
q£(m, n) and qj(m, n) converge to non-zero limits. Then one can arrange for P = p( m>n ) 

to vary as m and n tend to infinity, in such a way that the matrix M^'^ converges to a 
fixed irreducible and aperiodic matrix Mx, entailing the convergence of quantities such as 
T (m,n) ^ jj(m,n) an( ^ ^(m.n) ^ Q }j m jt s ^ v anc i r . With this in mind, define : = n~ 1 Nx 

and Q^y 1 '^ := m _1 iVV, and take p( m > n ) := (rrm) _1 / 2 II, for a fixed matrix II. This then 
gives il4 m ' n) = UQ^' n) U T Q { ^ n \ so that, if Q<g> n) -> Q x and Q^' n) Q Y , with Q x 
and Qy both having positive diagonals, then M^" 1 ' 11 ' — > Mx '■= HQy^- T Qx- If also, in 
keeping with the general assumptions of the paper, we have r > 1, then we describe this 
behaviour as 'standard asymptotics'. 

Other asymptotic settings could equally well be considered. For instance, there would 
be no great difference in the qualitative behaviour if r^ m ' n ^ were allowed to tend to infinity 
with n like a power of log n. 

3.3 Expectations 

We begin our analysis by examining the growth of the mean numbers of individuals of 
different types. Using E to denote E{- | J^}, we immediately have 

E{X T (t)\ T 2i - X } = Y T (i)P T N x ; B X T (t) = X T (0)M l Xl (3.7) 
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and 

V{Y T (t)\ F 2i _ 2 } = X T (i-l)PN Y ; E Y T (t) = X T (0)M^ 1 PN Y . (3.8) 
From these, and using (13. 4ft . we have, for instance, 

E{X k (^\^} = J2Y T (s)My- s ^(e^) T P T Nxe {k) 

3=1 

< c \\Y( S )\\ 1 r i ~ s jl T P T N x e^ = coWYis^r^C'^Mxe^ 
= coWYis^-'iT/Ofik, 

so that, for any 1 < k < K and 1 < j • < J, and for % > s > 0, 

E{X k (i)\F?} < coWXis)^'^,,; E{X k (i)\Fj} < co\\Y(s)\\ 1 C 1 T i -' +1 fjL k ] (3.9) 
V{Y^)\Tj} < CoWYis)]]^-*^; EfY^OlT^J < c \\X(s - 1)11^^(3.10) 
E X k (i) < coU^UiT-V*; EolS-(i) < collXWIKCr'- 1 ^, (3.11) 

3.4 X-Covariances 

Controlling the covariances of the components of X(i) and Y(i) requires more work. To 
start with, we observe that 

E(X T (i + l)u\J r l x ) = X T (i)Mv = TX T (i)v, 

so that Wi := r~ l X T (i)v, i > 0, is a non-negative martingale with respect to the filtra- 
tion {J 7 * , i = 0,1, . . .} which converges almost surely to a limit W, and 

E (X T (*)z/) = r i X T (0)z/. (3.12) 

The variability in the branching process is essentially determined by that of W, which 
is itself largely determined during the early stages of development. Indeed, writing 
X T (i) = X T (i){I — v[i T } + X T (i)i>n T , we show that the variance of X T (i)b is domi- 
nated by (/x T fe) 2 Var (X T (i)v), unless fi T b = 0. 

Lemma 3.1 The variance of the martingale Wi is bounded as follows: 

(i) Wax (Wi-W) < c 2 {t/(^ - l)}||^s:(0)|| 1 'r- i ; 
(ii) Vai Wi < c 2 {t/(t- 1)}||JT(0) || l3 

where c 2 := c r -2 ||z/||^ S'(S) and S(E) := maxi<k<x $^fm=i l^lml- ^ s defined below. 
PROOF: We begin by writing 

k x k (i) 

k=l r=l 
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where X^-i denotes the X-vector of descendants in X-generation i + 1 of individual r 
of type k in X-generation i. The random vectors (Xk r -,i, 1 < k < K, r > 1, i > 0) 
are independent, and, for each k, the X& r .j are identically distributed, with means the 
transpose Mxm of the fc-th row of M x , and with a covariance matrix that we denote 
by EW. Then' 



X * fc (i) 



fc=l r=l 



and so 



fc=l r=l 



X T (t + l)-X T (i)M x = J2J2( X kr;i~ M 
Considering the right hand side, we have 



T i 
X,[k]j 



(3.13) 



fc=l r=l 



yk=l r=l J \k'=l r'=l 

K X k (i) 



^ 2_j E{(X fcr; j — M x ,[k]){Xkr;i — M X ,[k]) | Fi } 

k=l r=l 
K 



(3.14) 



k=l 



by the independence of the vectors and their having mean zero. Hence, using f)3.14p . it 
follows that 



E{(r-^X T (z + l)u-r- t X T (t)u) 2 \ F t } = r~ 2 ^ ^X k (i)v T Zl%, (3.15) 
and thus, from (13. lip . 



k=l 



E {(r(' +1 )l T (i + l),-r-I T ( ! ),) 2 } < f^CollXWIKr— 2 ^ T S 

k=l 



[k] 



V 



< cb||X(0)||iT-'-1i/||^(E) = c.WXm.T 



(3.16) 



where 5"(S) := maxx< fc <^ Y^f m =i l^lml an< ^ c 2 := c o r 2 || Z/ IIL'5'(^)- Writing the martingale 
t~ % X t [i)v as a sum of its one-step differences, the lemma now follows easily. □ 

The next lemma controls the variances of those components of Xii) that are orthogonal 
to v\ note that 

X T (i) = (X T (i)v)» T + X T (i)(I-v» T ). 
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Lemma 3.2 With c 2 as in Lemma \3.1\ and 7 := max{r, IA2I 2 } < t 2 , and with C3 := 
cfS'(S), we have 

(i) Var {X T (z)(/-^ T )6} < c 3 ||X(0)|| 1 if 

(ii) Var {X r (z)6} < 2||X(0) || x {c 2 {r/(r - 1)} r 2i (/7&) 2 + c 3 i-y* I|6||2o> 
for any b G . In particular, with b = e k i/ie A; 4/i unit vector it follows that 

(Hi) Var {X k (i)-(X T (i)u)fi k } < csWXWWtij', l<k<K. 
PROOF: Recalling that /i T z/ = 1, it can be seen by induction that 

(M x -r(ufi T )) 1 = M l x -r\ufi T ). (3.17) 
Hence, with M x = I and as un T (Mx — tvij?) = 0, 

X T (i){I-vf) 

= E ( x2 > + !) r ^ - (^v 4 -*- 1 '] - [^r - } 

+ X T (0) [M x -(^ T y] 

= E { X "> + X )( J " ^) - ^ T W(Mx - r(V))} [M X - r(^)]— 1 

r=0 

+X T (0) [Mi - ( W] 

i-l 

= £ ^(r)A— 1 + X T (0) [M x - V] , (3.18) 

r=0 

where 

f/ T (r) := (X T (r + l) - X T (r)M x )(I - u/j, T ) and A := M x -t(v{i t ). 
Note, in particular, that E {U \r)U T (s)} = whenever r 7^ s, in view of (13.131) . and that 

E{f/(r)[/ T (r)|j; x } = ^X fc (r)(/-^)SW(/-z//i T ). (3.19) 

fe=i 

Hence, since (I — b>fi T )(Mx — tvh t ) = Mx — TvyJ ', it follows from (13. 5p and (13.111) that 

i-l 

Var {X T (z)(/-z/ / u r )6} = 6 T J](A r ) i - 1 - r Eo{C/(r)C/ T (r)}A i - 1 - r '6 

r=0 

i-l X 

< ^^E X fc (r){c 1 |A 2 |( l - 1 ^||6|| oo } 2 S(E) 

r=0 fc=l 

i-l 

< c ||X(0)|| 1 ^r r {c 1 |A 2 |(- 1 ^||6|| oo } 2 5(S), (3.20) 

r=0 

10 



for any b G R^, proving part (i), and part (iii) follows directly. Since also, from 
Lemma l3.1l (ii). 

Var {X T (*)^ T 6} < c 2 {r/(r - l)}!^)^ 2 ^) 2 , (3.21) 

it follows from part (i) that 

Var {X T (f)6} < 2{Var {X T (2)z//i T 6} + Var {X T (i)(/-^ T )6}} 

< 2{c 2 {r/(r-l)}||X(0)|| 1 r 2i ( / i^) 2 + C3 ||X(0)|| 1 ^||fe||L}, (3-22) 
establishing part (ii). □ 

Note that the growth of Var o{X T (i)b} with i is at rate 0(iy H&H^), slower than r 2t , if 
fi T b = 0. Note also that, if r ^ | A2 1 2 , the factor i can be replaced by a constant c(r, | A2 1 2 ) - 

Corollary 3.3 For all 1 < k,l < K , there are constants c§, c% such that 
E {X fe (z)X,(z)} < c 5 ||X(0)|| 2 //^T 2i + C6 ||X(0)|| 1 zTY /2 - 
In particular, for c' 5 := C5 + cqK 2 9, where 9 is as in $3. 6]) . we have 

E ||X(*)|| 2 < 4r 2 lX(0)|| 2 . 

PROOF: It follows from (13. lip and Lemma I3.2l (ii) with the Cauchy-Schwarz inequality 
that 

E {Xft(i)*i(i)} < |E X fc (i)EoX,(i)| + \Cav (X k (i), 
< {c 2 ||X(0)|| 2 + 2c 2 ||X(0)|| 1 t/(t - l)}/x fc ^r 2i 

+ 4||X(0)|| 1 (^) 1/2 rVc2C 3 {r/(r - 1)} + 2 C3 ||X(0)|| 1 z 7 \ (3.23) 

and the corollary follows by taking c 5 := c\ + 2c 2 r/ (r — 1) and c 6 := 4a/c 2 c 3 {t/(t — 1)} + 
2c 3 . □ 



3.5 Y-Covariances 

Very similar arguments can also be carried through for the vectors Y(i), i > 1. We 
first show that Yj(i) is close enough to X T (i — l)PNye^\ for any 1 < j < J. Let 
Zj{i) := ^-(i) - X T (z - l)PN Y e ij \ 

Lemma 3.4 There is a constant C10 suc/j i/ioi, /or 1 < j < J, 

Varo^^i)} < CiollX^HxT^-. (3.24) 
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PROOF: First, the quantity Z T (i) := Y T (i) — X T (i — l)PNy is represented in fashion 
analogous to A3. 13[) . For independent random vectors Yk s , r as in (12.11) . 

K X fc (i-1) 

Z T ti) = Y T {i)-X T {i-l)PN Y = Y, J2 (V1-EV1), 

k=l s=l 

from which it follows that 

K 

E{Z(i)Z r (i)|^ 1 } = IJeM 

fc=i 

where := diag {rrijPkj(l —Pkj), 1 < j < and hence that 

K 

E {Z(*)Z T (*)} = ^E X fe (i-l)EW. 

fe=i 

This with (13.111) and (13. 2p in turn yields 

K 

Varo^i)} = ^E X fe (z-l) e ^) T sWgW 
fc=i 

< CollX^H^-^/ifem^^l-^) < dollX^UxCr^, (3.25) 
fc=i 

with cio := cq/t. □ 

We now, for future use, define the quantity 

C* := J max II PN Y e {j) II (3.26) 
i<j<j" 

noting also that 

j 

(rUinftjC < C = ^/x T PiV y gW < [ImIIiC- = C*. (3-27) 



Hence, in view of (13. 3p . we write 



:— Z*W — , (3.28) 



noting that 3 and can be thought of as having comparable magnitude. We also 
introduce the notation 

: = (n) }' < 3 29) 

Using Lemma T3.4[ we can now establish an analogue of Corollary 13.31 for the elements 
of Y{i). 
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Corollary 3.5 For all 1 < j,l < J, there is a constant c 6 such that 

E {Y j (i)Y l (i)} < c,e\\xm\^mr 2(i - i] + c^jxm^- 1 ^ 2 , 

where C5 is as in Corollary VJ.'d In particular, for c'l := 3 2 Cs + c' & J 2 9, we have 

EolinOII? < ^L^^ ||X(0) 11?. 
Proof: We first note that 



< |E ^(i)E y z (i)| + ^Vax {l}(i)}Vaxo{H(i)}. 

Now, writing Y^-(i) = X T (i — l)PNye^ + Zj(i), it follows from Lemmas I3.2l (ii) and 13.41 
and from (13. 26j) that 



Var {^(i)} 

< VnxWi { V / c 2 r/(r - ly-'C^ + V^-^T-'J^C* + V 010^/2} 
V 2 ll^(0)l|i { - l)^ -1 ^ + ^mnVc^T 1 " 1 



< 



where -y/cj = (J 1 y / C3-Z* V a/ cio3t/2). A similar bound holds also for A/Var {V;(«)}. 
The corollary now follows from ( 13. lip , and by taking c' 6 = 43\/c2C4t/ (r — 1) + 2C4. □ 



3.6 X- and y-approximation 

Using the preparation above, we are now able to approximate Xk(i) and Yj(i), i > 1, in 
terms of the limiting random variable W, making precise the description at the end of 
Section [2j and bounding the error in the approximation. We begin by considering the 
X-components. 

Theorem 3.6 There is a constant eg such that, for 1 < k < K , 

Er 



X k (i)-rW/i k < c 9 ||^(0)|| 1 ((i + l)^^ 1 / 2 , z>0. (3.30) 
Proof: Here, we note from Lemma O(iii), fl3J|, ([32D and ( TBTTj) that 

E |x fc (z)-(X T (z)z/K 

< (Var {X fc (z) - (X T (i)v)fi k }) 1/2 + \E {X k (i) - (X T (i)v)fi k }\ 

< (c 3 \\X(0) || ^) 1/2 + \\X(0) || x || (Mi - rV/)e« |U 

< c 7 \\X(0)U(z + l)Y) 1/2 , 
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with c 7 := y/ci + c±, whereas, from Lemma l5TTI (i) . and the Cauchy-Schwarz inequality, 



E r 



(X T (i)u-r i W) t i k < ^ h T i {c 2 {T/{T-l)}\\X{0)\\ 1 T- i }^ 2 < csWXm^ 2 , (3.31) 



with c 8 := {c2t/(t — l)} 1 ^ 2 . Hence the theorem follows, with eg := Cj + eg. 



□ 



With the help of Lemma [3.41 we can also approximate Y(i) in terms of the limiting 
random variable W, complementing Theorem 13.61 



Theorem 3.7 There is a constant cu such that, for each 1 < j < J, 

E |^(z) - r^CW^I < c u u mn \\X(0)\\ 1 (i?) 1 fi l , i > 1, 
where jx is given in A3.2\) . 
Proof: It is immediate from Lemma [3.41 that 



(3.32) 



Er 



Y j (i)-X T (i-l)PN Y e^ < {cioCll^WHillAlloor*} 1 / 2 < chC 1/2 ||X(0)||it^ 2 , 



with Cn := \J Cio||/i||oo, and then, as in the proof of Theorem I3.6j using Lemma l3~TI (i) 
fl3TT7j) and (j33|) . we have 



E 



X T (i - l)PN Y e {j) - (X T (i - l^fij 

< {(csll-XTCO)!!!^ — l)^ 1 ) 1 / 5 " -h (0)|| lT C*- 1 >/ a }|| J PJWeC^|| e 

< c 12 C*||*(0)||i(z7 l ') 1/2 , 



with C12 := J 1 ^/l/7( v / C3 + C\). Then, once again invoking Lemma l3TTI (i). as for (I3.3ip . 
we have 



E 



(X T (i-l)vji j -T i - 1 tWjM j < cgCAill^WHir^- 1 ^ 2 < c 13 Cll^(0)||iT V2 , 



with C13 := (cg/v^ 1 ) ||A||oo- The theorem follows, since 7 > r, by taking c u := ({cha/3} V 
{c 12 Z, + c 13 3}). □ 



4 Ghosts 

In the previous section, we justified simple approximations to the joint counts X(i) 
and Y(i) in the bipartite branching process. We now need to show that the same approx- 
imation can be used for the composition of the neighbourhoods in the intersection graph, 
albeit with a further error. This involves showing that the effect of the 'ghosts' is not 
too large. Let G^{i) and Gj(i) denote the total numbers of type (k, 1) and of type (J, 2) 
individuals of class (ghosts), respectively, alive in generations 2i and 2i — 1 respectively 
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of the bipartite process that starts with individuals A and B. Then it turns out to be 
enough to derive bounds for their expectations, as functions of i. 
To state the result, define 



p x := max U,ulq X \ p Y := max Ui/qJ, (4.1) 
i<k<K ' k i<i<J 3 

where q x and q Y are as in (13. ip . Note that, for the Z-process under consideration, 
ll^(0)||i = 2. 

Theorem 4.1 There are constants cj 5 and c* 6 such that 

E G x (i) < c* 15 T 2i e(m,n)\ 1 < k < K; 
EoGjii) < C * w ^r 2 ^e(m,n)\ 1 < j < J, 

where e(m, n) := n -1 / 4 + m -1 / 4 . 

Proof: The ghosts can be counted by descent from original ghosts, whose parents were 
of class 1. We write H x (l) and Hj{l) to denote the numbers of original ghosts of the 
corresponding types born in the /-th generations of the X and Y processes. For i > I > 0, 
we then let G XY (l, i) and G xx (l, i) denote the total numbers of descendants of generation / 
original Y and X ghosts, respectively, alive at time 2i in the bipartite process, that are 
(k, 1) individuals; the quantities Gj Y (l,i) and G YX (l,i) are defined analogously. Thus 



G X (i) = H x {i) + Y,{ G k Y ^ + ^) + G xx {l^)} ] (4.2) 

1=0 

i-l 

Gj(i) = H Y (i) + J2{Gj Y (l,i) + G YX (l,i)} + G YX (0,i). (4.3) 
i=i 

Note that, for % > I > 0, from (jH2J) and (l3~3l) . 

E(G XY (l,i)\H Y (l)) < 003^11^(0111^)/^; (4.4) 

V m 

and that, for i > I > 0, 

E(G^(/,OI^ X (0) < Coll^WHir^W; (4-5) 

E(Gf y (/,0|F y (/)) < CoU^lliT^-; (4.6) 

V{G YX {l,i)\H x {l)) < c Q -i\\H X {l)hr 1 - 1 - 1 ^^ (4.7) 

from (E3D and (l3~T0|l . 

Now an original ghost of type (j, 2) is created when an index from the set {1, 2, ... , rrij} 
is re-used. Hence 



E{H Y (l)\T Y } < J5(l){VE y j(4 
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Furthermore, from ( I3.10p . for / > s > 0, 
Combining these bounds, it follows that 

i 

VoHf(l) < cbm7 1 /2 i ^T J -*E (^-(s)||y( S )|| 1 ), (4.8) 

s=l 

and hence, using Corollary 13.51 that 

i 

Eo||# y (/)||i < c m- 1 p Y J2r l - s Eo\\Y(s)\\ 2 1 < c.rm- 1 p Y u 2 mn r^- 1 \ (4.9) 

8=1 

where cn := 4c^'cot/(t — 1). Similar calculations show that 

i 

E H x (l) < co^ 1 / i fc ^^E (X fe ( S )||X( S )|| 1 ), (4.10) 

s=0 

and, with Corollary 13.31 

E ||# X (/)||i < c 18 n- 1 p x r 2 \ (4.11) 

with Ci8 := 4cgCor/(r — 1). For future reference, we note also that, in consequence, for 
any s > 0, 

$V<E ||if^)||i < ^ m -74^; 

Z — / T — 1 

1 = 1 

y>*-*Eo||F*(0||i < ^-n-V X r 2s . (4.12) 

r — 1 



It now remains to take expectations in f ]4.2[) and ( 14. 3ft . using ( 14.4j) ( 14/71) and ( 14. 12[) . 
For instance, for E, G Y (i), we have 

i-1 i-1 

«=i i=i 
with Ci 9 := C0C17/ (r — 1); similar calculations yield 

^EoG^(M) < c 20 n^p x p 3 T^J™ : 
1=0 ' n 

with c 20 := 3c Ci 8 r/(r - 1), and 

E H Y (i) < c 17 m~ 1 p Y u 2 m y^ 1 \ 
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from (I4.9p . Hence it follows that 

with Ci6 := C20 1 1 A* 1 1 oo and C\q := cigH/iHoo + cyj. A similar argument yields the bound 
with 

C15 := ci 8 |c (r - H/illoo + l|; C15 := c Ci 7 3 _1 (r - ||HU- 
Recalling the definition f!3.28j) of Z*, this completes the theorem, with 
c* 15 := 2(ci 5 p x Vci 5 p y ); ct 6 := 2(c 16 p x V Ci 6 p y ). 

□ 



5 The probability of a common label 

Our next step is to establish a Poisson approximation for the probability of a coincidence 
in a random labelling problem. The underlying idea is to look at neighbourhoods of radii 
%a and is of two initial vertices A and B; if they have no vertices in common, then the 
distance between A and B exceeds ia + is- Whether two vertices in the neighbourhoods 
are the same can be thought of as a labelling problem, where the assignment of labels is 
almost uniform and at random. The result that we need is the following variant on the 
Poisson approximation to hypergeometric sampling. 

Proposition 5.1 For each I, 1 < I < L, and for each r, 1 < r < Ri, we independently 
draw a subset {Ji r ±, . . . , Jirz w } of size z\ r from a fixed set W\ of size w\ > 2, with any 
subset equally likely to be drawn; define z\ := Yl^Li z ir- We then repeat the experiment 
independently, with subsets { J' lrl , . . . , J! , } of sizes z' lr , 1 < I < L, 1 < r < R[. For each 

Ir 

w EWi and for each I, r, s, r', s' , define 

H(w, I; r, s; r', s') := I[Ji rs = J' lr , s , = w}. 
Then, for fixed subsets C Wi of sizes ivf , 1 < I < L, define 

L Ri R'i z w z [ r i 

S := S(z,z';w,w*) := Yl Yl Yl H ( w ' i; r ' s; r '' s "> 

1=1 1U0V* r=l r'=l s=l s'=l 

to be the number of pairs of elements, one from each sample, that consist of two copies of 
the same element, not belonging to any of the W* . Then 

\P[S(z, z'] w, w*) = 0] — exp(— \(z, z', w, L))\ < B 1 (z, z', w, L) + B{(z, z' , w, w*, L), 
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where 



X(z,z',w,L) := Y.—^ B 1 (z,z',w,L) := 2 £ ^±4 (5-1) 

Wl Wi 

1=1 1=1 

and 

Bl(z,z',w,w\L) := (5.2) 



= 1 



PROOF: The indicators /; r, s; r', s') are negatively related (for the definition see [I] 
Definition 2.1.1), as can be seen by constructing an explicit coupling very much as in jl], 
p.112. Setting 

"lr lr' 

H(l,r,r') := ^ ^ £ /; r, s; r', s'), 



tw* s=l s'=l 



the random variables H(l,r,r') are pairwise independent, and satisfy 
EH(l, r, r') = (l- W *^ ^ ■ 



Wl J Wi 

f\ ( z lr "I" z l r ' 1 



< EH{l,r,r') - Var H{l,r,r') < EH(l,r,r' 
l R t R[ 

1=1 r=l r'=l 

Hence, since w i > 2 for all I, it follows that 

Zl r + Zi r i 



Wi — 1 



so that 



< ES-VarS < 2 max lr \ ES, 

l,r,r' \ W[ 



■tr Q L Ri L R[ ! 

v^b 2 rf* +2 rrv < Bl (z,z>,w,L). 



From this, and since 

L Ri R[ 

EEE Ei ^' r ' r ') = Hz,z',w,L)-B* 1 (z,z',w,w*,L), 

1=1 r =l r'=l 

it follows using jl] Theorem 2.C.2 that 

iPfS^z, z';w, w*) = 0] — exp{ — [X(z, z',w, L) — Bl(z,z',w,w*, L)]}\ < Bi(z, z' ,w,L). 

(5.3) 
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The final estimate follows because 



|e" A -e~ A '| < min{l, |A - A'|} (5.4) 
when A, A' > 0. □ 

Note that, if, for some r > 1, none of Z\ and z[ exceeds C\T % I 2 and all of the wi exceed 
C 2 t\ then 

Bi{z,z,w,L) < 



geometrically small with i. 

Now suppose that we do not know the true values z\ and z[, but only approximations 
zi and z\ to them. Then we can instead use these to approximate P[S(z, z' ,w,w*) = 0], 
with some possible extra error. 

Proposition 5.2 Suppose that 

\zi — zA = E\ and \z[ — z[\ = e',, 1 < I < L. 



Then 



\P[S(z,z',w,w*) = 0] -exp(-\(z,z',w,L))\ 

< B 1 (z, z', w, L) + B{ (z, z\ w, w*, L) + B 2 (z, z', e, e\ w, L), 



where 



B2(z,z',e,e',w,L) = X(z, e',w,L) + X(e,z',w, L) +X(e, e', w,L), 
where A := min(A, 1). 

PROOF: Immediate from (15. 4p . □ 

In Section [6l we take for z and z' convenient approximations to numbers of individ- 
uals alive in generations r > 1 in the bipartite branching process that starts from the 
two individuals A of type (ki, 1) and B of type (k 2 , 1). For r = 21 even, we take z to 
approximate the X A {1) descendants of A, and z! to approximate the X B {1) descendants 
of B, and W\ — ni, 1 < I < K. For r = 21 — 1 odd, we take z to approximate the Y A {1) 
descendants of A, and z! to approximate the Y B (l) descendants of B, now with wi = mi, 
1 < I < J, We then show that these approximations are sufficiently close to the corre- 
sponding numbers z and z' of class 1 and class 0' descendants of individuals A and B in 
generation r, so that P[S(z, z', w) = 0] is correspondingly close to exp{— A(5, z', w)}. 



6 Approximating inter-point distances 

We now return to the problem of real interest, the distribution of the graph distance 
D := Dk lt k 2 between two vertices (ki, 1) and (k 2 , 1) in the intersection graph, taken to be 
infinite if the vertices are in different components. 
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6.1 Conditioning on the branching process 

We begin by approximating the conditional probability P[D > d\Z] that A and B are 
more than distance d apart, given the trajectory of the bipartite process Z starting from A 
and B. The conditional probability is then a function only of the way in which the labels 
were assigned to the individuals in the process Z. The labelling determines the classes of 
the individuals, and the event {D > d} occurs exactly when there are no overlaps between 
the labels of the class 1 and class 0' individuals that are descended from A and those of 
the descendants of B, at any generation I, 1 < I < d, of Z. Let Q s denote the information 
in the labels up to generation s. 

Proposition 6.1 For any 1 < I < d, 

\P[D>l\Z,g l _ l n{D>l}]-e- x ' {l ' z) \ < <f>(l,Z), (6.1) 

where (ft is given in 116. 6\) and A6.11\) , and X'(l, Z) in Ii6. 8\) and A6.10\) . It then follows in 
particular that 

d d 

V[D>d\Z]-J\e- x '^ z) < ^2<f>{l,Z). (6.2) 
1=1 i=i 

PROOF: Suppose, first, that I is even. By Proposition 15.11 the probability 

P[D >l\Z, C?z_i n{D> I}] 

is close to exp{—X(z(l),z'(l),n,K)}, with n k = n k , z(l) the numbers of children of the 
different types of class 1 descendants of A in generation I — 1, and z'(l) is the same for 
descendants of B. When applying Proposition I5.1[ Ri represents the number of class 1 
descendants of A in generation I — 1, and z\ r the number of offspring of the r-th of these; 
these offspring make up the class 1 and class 0' descendants of A in generation I. The 
error in the approximation is then no larger than 

fli(s(0, n, K) + Bl(z(l), z'(l),fi, n\l — 1), K), 

a quantity that we shall need to bound later, where n* k (l — 1) is the number of labels for 
(k, 1) individuals already used up to generation I — 1 of the Z-process. 

The quantities z(l) and z'(l) appearing in A are not directly accessible, and are not 
functions of Z alone. However, we can exploit Proposition 15. 2\ provided that we can find 
suitable approximations to them. The first is to replace z(l) by X A (l/2), the numbers 
of descendants of A of the different types in generation 1/2 of the X-process, and z'(l) 
by X B (l/2), noting that 

< Xi{l/2)-z k {l) < Gf(//2); < (1/2) - z' k {l) < G£(Z/2), (6.3) 

and that the number of ghosts G k (l/2), investigated in Section HI is calculated for the 
whole bivariate process. Then the quantities X A {l/2) and X B (l/2) can in turn be more 
simply approximated, using Theorem 13. 6[ by T l ^ 2 W A jj and r l ^ 2 W B fi, where W A is the 
limit of the martingale t~ 1 v 1 X A (i), and W B the limit of t~ % v t X B {i): note that these two 
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random variables are independent, by the branching property. From Proposition 
replacing z(l) by T l ^ 2 W A jj, and z'{l) by r lj/2 W B [i, we incur a further error of at most 
B 2 (X A (l/2),X B (l/2),e A (l),e B (l),n,K), where 

e A (l) = G x (l/2) + E(l/2, X A ); e B (l) = G x (1/2) + E(l/2, X B ), (6.4) 

with E(l/2,X A ) = X A (l/2) - r l / 2 W A /2 and E{l/2,X A ) defined analogously. By Theo- 
rem [3JH 

E{E(i,X)} < c 9 ((z + l) 7 l ) 1/2 , (6.5) 

for X = X A and for X = X B . 

We also clearly have n*(l — 1) < T x ((l — 2)/2), componentwise, where 

s s 

T x (s) := ^X(r) = ^(X^(r)+X B (r)), 

r=0 r=0 

and, as observed above, z(l) < X A {l/2), z'(l) < X B {l/2). As a result, the approximation 
error at this step is no larger than 

4(1, Z) := B x (X A (l/2), X B (l/2), n, K) + B{(X A (l/2), X B (l/2), n, T x ((l - 2)/2), K) 
+ B 2 (X A (l/2),X B (l/2),e A (l),e B (l),n, K), (6.6) 

with e A {l),e B (l) as in (16.41) ; thus we have, for I even, 

\P[D>l\Z,g l _ 1 n{D>l}]-ex V {-\'(l,Z)}\ < 4(1, Z), (6.7) 

where, for I even, 

X'(l,Z) := \(r l/2 W A fi,r l/2 W B ^n,K). (6.8) 
A similar argument for I odd yields 

\P[D>l\Z,g l ^ l n{D>l}]-exp{-X'(l,Z)}\ < 4(1, Z), (6.9) 

where, for /odd, 

\'(l,Z) := X(Cr {l - 1)/2 W A jl,Cr {l - 1)/2 W B jl,m, J), (6.10) 
4(1, Z) := B l (Y A ((l + l)/2),Y B ((l + l)/2),m,J) 

+ + l)/2), Y B ((l + l)/2), m, T y ((/ - l)/2), J) 

+ 5 2 (F A ((/ + l)/2), Y B ((l + l)/2), r] A (l),r] B (l), m, J). (6.11) 

Here, 

rf(0 = Gl((l+l)/2)+E'((l+l)/2,Y A ); V B (l) = G Y k ((l + l)/2) + E'((l + l)/2,Y B ), 

(6.12) 

and 

E{E'(i,Y)} < c 14 n mn (2Y) 1/2 , 

for Y = Y A and for Y = Y B , by Theorem 13.71 This proves the first statement of the 
proposition. 
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The second part is easier. We first note that 

P[£> >l\Z] = E{E(/[D > I - 1]P[D >l\Z, Qx_ x n {D > I}} \ Z) \ Z}, 
and deduce from the first part that 

\P[D > l\Z]-e- x ' {l ' z) P[D > I -1\Z]\ < (f)(l,Z), 
from which the last part follows. □ 

Thus, combining 06.71) and (16. 9p with Proposition 16.11 we find that 

d 

\P[D> d\Z]-exp{-W A W B L(d)}\ < (6.13) 



i=i 



where, using (I3.3p . 



L(2i) := V4 - 2 E- + C 2 E- 



T 2i -1 , 

75 — T r ( r 
r z — 1 



k=l Uk j=l "" 3 

K 2 / \ K 2 



l(«+i) : = ^{(^-^E^ + ^-dc 2 !:^) 

r 1 I fc=i nfc j=i m i ) 



= -J—\(t 2 * +2 -t 2 + t 2i+3 -t)Y^ 

= f- L T )n-v i+i -i)f:4 ) 

and hence, with (12. 3p . 

L(d) = Kn-\r d ~l), (6.14) 

for d both even and odd. 

6.2 The unconditional distribution 

The unconditional probabilities for D are now given by taking expectations in conjunction 
with (16.131) . so that it just remains to evaluate the terms E fcl]fc2 (/>(/, Z). Note that, in the 
approximation, randomness comes in only through the independent random variables 
W and W B , the first with a distribution which depends only on the value of ki, and the 
second on &2- 

To assist in judging the impact of the various factors in our bounds, it is convenient 
to define 

logn 



logT 



(6.15) 
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so that t 1 ° < n < r l0+1 , and to set 

r" 1 < ip(n) := tTV < 1. 
Then, with k given in (I2.3P and under standard asymptotics, 



L(d)-T d - io K<p(n)\ < 



X 



n \t 



-> 0, 



(6.16) 



and k remains bounded away from and oo. 

Theorem 6.2 For d = io + u, with «6Z and |w| < io/2 ; we have 

\V kl ,kA D - io > «] - E fcllfc2 exp{-^W B K r>(n)}| 

< c 25 {(r 3 "/ 2 + l)(n 1 / 4 e(m,n) 2 A 1) + (r u + ^^^(m.njfij, 

/or a suitable quantity c 2 §, where 

9i := (i + l^T/r 2 )*/ 4 . (6.17) 

PROOF: The approximating expression is immediate, from ( I6.13P and ( I6.16p . incurring 
an error of at most 



x 



P 

n \t 

For the rest, we just need to investigate Efc 1> fe 2 0(i, Z) for 1 < / < i + u. 
To start with, for I = 2r, we have 

E klM B 1 (X A (r),X B (r),n 1 K) < An- 1 Kc r r p x < n' 1 ^ 

with c 22 := 4:Kc p x , from (O) and fl3~TTT) . Then 



(6.18) 



E fel)fc2J B*(X^(r), X"(r), n, T (r - 1), K) 

A" r-l 



fc=l s=l 

< c 23 n~ 2 T 3r , 



(6.19) 



where c 23 = 2fsr(cop x ) 2 (c 5 ||/i|| 00 + CqK6)/(t - 1), from ([33]) and Corollary [331 For 
E fclife2J B 2 (X A (r),X i3 (r),^(2r),£ B (2r),n,i ; s:), with e A and e B defined as in (JS3D, we need 
to be a little more careful, because of the product G x (r)(X A (r) +X B (r)). However, from 
Theorem 14. 1[ it follows by Markov's inequality that, for any $ = $(m, n, r), 



fei,fe 2 



max G x (r) > $ 

Kk<K 



< P 



fei,fe 2 



A' 



(6.20) 



and because B 2 can never exceed the value 3, it follows that 
E klM B 2 (X A (r),X B (r),e A (2r),e B (2r),fi,K) 

< E klM B 2 (X A (r),X B (r),e A (2r),i B (2r),n,K) + QP klM 



max G x (r) > $ 



Kk<K 



< E klM B 2 (X A (r),X B (r),e A (2r),e B (2r),n, K + Qc 2 ^~ l T 2r e{m, n)\ 



~A 



(6.21) 
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with 

gf(2r) = $ + E(r,X A ) and e£(2r) = $ + E(r,X B ), 
and with C24 := ^c* 5 . But now, from (16. 5p and (I3.1ip . it follows that 

E kuk2 B 2 (X A (r), X B (r), £ A (2r), e B {2r), n, K) (6.22) 

< 2Kp x n- l c T r {<$> + c 9 ((r + l) 7 r ) 1/2 } + 2n- 1 {$ 2 + c 2 (r + ^ ~ ■ 

k=i qk 

Choosing $ 2 (m,n, r) := T r ne{m 1 n) i 1 and then adding the contributions from (I6.18P - 
(I6.22p for 1 < r < [(^o + M )/2j gives, after some computation, a bound of the form 

4 K /4 (r 3u/4 + l)e(m, n) 2 + (r u + + (r 3u / 2 + l)^ 1 / 2 ) . (6.23) 

Bounds analogous to (I6.18P and (I6.19P hold also for I = 2r — 1, with Y, J, m replacing 
X, K, n throughout the argument and estimates, and with C22 and C23 replaced by c 22 i 

and c 2 3 yf i. where 

4 = 4c 3r- 1 Jp y and c 23 = 2JZ(c p y /r) 2 (c 5 3 2 r- 2 ||p|| oo + c' 6 J9)/(r - 1). 
The bound corresponding to (16.201) is 



max G^(r) > $ 
i<j<J 3 



< $- i y^r 2(r - 1) Jct 6 e(m,n) 4 , (6.24) 



r k\,k2 

and we also have 

E klM B 2 (Y A (r), Y B (r), fj A (2r - l),f) B (2r - 1), m, J) (6.25) 
< 2Jp ¥ m~ 1 T- 1 c T r 3 x f^{$ + c M u mn (n r ) 1/2 } + 2m~ 1 {$ 2 + (c 14 u mn ) 2 n r } Y\ \, 

with 

f) A (2r-l) = $ + E'(r,Y A ) and fj B (2r-l) = $ + E'(r,Y B ). 

Here, we take $ 2 := mr r e(ra,ti) 4 in (I6.20p and (16.251) . and then, adding the errors over 
1 < r < [iq + 1 + u\ , and after much calculation, a bound of the form 



c'25 |n 1/4 e(m,n) 2 (r du/4 + 1) + n 1/4 e(m, n)(r u + 1)^ + (r^ /2 + l)^e(m,n) 2 \ (6.26) 
is obtained. 

To deduce the bound given in the theorem, it now suffices to observe that 



(n^e(m,n) 2 ) 2 > n^ 2 + Me{ 
v V m 



so that the final terms in f !6.23|) and (I6.26P can be absorbed into the first term, if the 
larger of the r-exponents is used. □ 
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The defective real valued random variable U, whose distribution function 

Pk lM [U<u] = l-E klM exp{-W A W B KrMn)} (6.27) 

approximates that of D — i for integer arguments, can be expressed as a (defective) 
translation mixture of scaled negative standard Gumbel random variables. If W A W B has 
distribution function F ku k 2 on an d if 

P kl ,k 2 [U' <u] := [ P[-(logr)- 1 (r + logs + log k) < u] dF klM (x) , (6.28) 
J (0,00) 

where T denotes a standard Gumbel random variable, then 

Pk uk2 [U<u] = P fcl)fe [C/ / <u + log^(n)/logr]. 
Alternatively, we can write 

P[£/ = oo] = P[U' = oo] = F klM (0) = l-P kl [W>0]P k2 [W>0], (6.29) 
and express the distribution C{U' \ U' < 00) as that of a random variable U, realized as 

U = {r + logWA + log^B + logK}, (6.30) 

logr 

where T, Wa and Wb are independent, 

P[W A <w] = P kl [W < w\ W > 0] and P[W B <w] = P k2 [W < w | W > 0]. 

Note that -^1^2(0) = P[U' = 00] indeed approximates the probability that (kx,l) and 
(/?2, 1) are in different components of the graph, and are hence at infinite distance from 
one another, as can be seen in the following result. 

Theorem 6.3 There are constants T\ > 1 and C26 < 00 such that 

max{P fel)fe2 [D < 00 I W A = 0], P klM [D < 00 | W B = 0]} < c 26 r~ l °. 

PROOF: We make the calculation for A; for B the argument is the same. From the general 
theory of multi-type branching processes, see for example |7] or [12J, conditional on the 
event {W A = 0}, X A is a subcritical branching process, and there exist T\ > 1 and C < 00 
such that 

E k [\\X A (t)\\ 1 \W A = 0] < Ct{ 1 for all k, 

and thus 

B k [\\Y A (i)\\ 1 \W A = 0] < CVf l+1 c C for all k. 

Hence, immediately, 

P klM [2t <D<oo\W A = 0] < Cr[ io . 
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Then, for 1 < % < i , from (13. lip . 



K 



k=l 

P klM [D = 2i-l\W A = 0] < m- l JCC 2 r{ i+l c T l - l p Y < n~ l JC3 2 T^ i+1 c T i - 1 p Y , 

and the theorem follows by adding over 1 < i < i . □ 

In view of the considerations above, our approximation can be summarized as follows. 

Corollary 6.4 For d = i + u, with itfZ and \u\ < io/2, we have 

\P klM [D <u + i ]-P klM [U' <u + \og<p(n)/\ogr}\ < (5(r u ,m,n); 
\ p ki,k 2 [D = oo] - P klM [U' = oo]| 

< 5(n a ,m,n)+n- 1 + P klM [0 < W A W B < r^K^rC 01 logn] + 2c 26 rf i() , 

for any < a < (io — 2)/2i « 1/2, where 

5(y,m,n) := c 25 {(t/ 3/2 + l)(n 1/4 e(m,n) 2 A 1) + (|/ + l)n 1/4 e(m,n)4}, 

io is as in $6.15\) . Qi is as in \6.17\), T\ is as for Theorem \6.3\ and U' has distribution given 



either by ( fQ5|) or by HTM) and ( fOSj) . 

PROOF: The first inequality is from Theorem 16.21 For the second, we have 
P kl ,k 2 [D < oo] 

< p klM [D < oo | w A = o] + p fcl)ta [z> < oo | w B = o] + p fclifc2 [iy A iy B > 0], 

giving 

P klM [D = oo] > l-V klM [W A W B >0]-2c 26 rr JO = P fclite [£/' = oo] - 2c 26 rf io . 

On the other hand, taking u = \oliq\ m the first part, we have 

P kl , k2 [D = oo] < 1 - P kl , k2 [U < u] + 5(n a ,m,n), 

and, from (|6.27|) . for any C > 0, 

Pk u k 2 [U < u] > 1 -P klM [W A W B < Cn~ a \ogn] - exp{-Cn- a \ognK(n a /r) V (n)} 
> P klM [W A W B > 0] - P felife2 [0 < W A W B < Cn- a \ogn] 

— exp{— C lognn/r 2 }. 

Hence, taking C = t 2 /k, 
P klM [D = oo] 

< P klM [U' = oo] + P fcl , fe [0 < W A W B < r 2 K -1 n -a logn] + n' 1 + o"(n a ,m,n), 

26 



and the corollary is proved. 



□ 



Remark. The corresponding result for the unipartite Erdos-Renyi graph may also be of 
interest, although, as discussed at the end of Section [Q it is not directly useful for our 
purposes. For such a graph, the vertices are divided into K types, with rik of type k, 
1 < k < K, and with n := J2k=i Edges are then independently assigned, with 
probabilities pk,k' depending on the vertex types k and k'\ the matrix P is thus symmet- 
ric. The mean matrix M for the associated branching process is given by PN, where 
iV := diagjni, . . . , n^}, and we assume that it is irreducible and aperiodic, and that its 
largest eigenvalue f > 1. With these assumptions, and writing fi T for the left eigenvector 
of M with eigenvalue f , only small changes need to be made to the sketched argument 
concluding Section [2j Considering coincidences in the indices in order of increasing gen- 
eration number, and with the offspring of A considered before those of B, links in the 
Erdos-Rehyi graph arise exactly when there are coincidences between indices of the X A (i) 
and those of the X B (i — 1), or between indices of the X B (i) and those of the Xj?(i). This 
leads to an approximate mean number of coincidences, up to and including the time when 
the first i generations of descendants of A and the first i — 1 of B have been considered, 
of kn~ 1 f 2l ~ l W A W B , where 

k 2 



. . Ok 



and qk '■= n^/n. For the time until the first i generations of both have been considered, 
the corresponding approximation is kn~ 1 f 2l W A W B . This gives the probability that the 
distance between A and B exceeds d as being approximately 

E klM {e^ dwAwB }, (6.31) 

very much the same as the formula in Theorem 16.21 Note once again that the assumption 
of irreducibility prevents this line of argument being directly applicable to the bipartite 
model. 



6.3 Asymptotic behaviour 

Recalling the standard asymptotics of Section [3j we now distinguish the possibilities for 
the bipartite branching process starting with a single vertex of type (k, 1), according to 
the behaviour of the ratio m/n, as n — > oo. This, in turn, enables one to deduce the 
asymptotic form of the approximating random variable U'. 

First, note that the Yj m ' n \l) ~ Bi (mqj , (mn) -1 / 2 !!/^), 1 < j < J, are indepen- 
dent. If m/n — y r with < r < oo, then the Poisson approximation to the bi- 
nomial distribution thus shows that the distribution of Yj m ' n \l) differs in total varia- 
tion from Po (^UkjqJ (m, n)) by at most (mn) -1 / 2 !!^ ~ n" 1 !!^ / \/r , and, conditional 
on y( m ' n )(l) ) the distributions of the Xi m ' n \l) are independent, and close to the same 

order to Po (X)/=i Yj^^i^^ijQ? ( m } n)r~ l l 2 ). Hence, as m and n tend to infinity in this 
way, the bipartite branching process converges to the one with exactly Poisson offspring 
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distributions and with Qx' n and Q Y replaced by Qx and Qy. Hence the distribution 
£(iy{m,n) | x(0) = e^) converges to £(W \ X(0) = e^), for each k, where W is the lim- 
iting random variable associated with the limiting Poisson-based branching process. It 
thus follows that the distribution of the random variable U ,(jn,n ^ also converges to that of 
the corresponding U' . However, the distribution of U (jn,n ' > does not converge in general, 
because the value of log<p(n)/ logr oscillates between —1 and as n varies. 

If m/n — > oo, the Poisson approximation Po (^/m/nllkjqj(m, n)) to the distribution 

of Yj m ' n \l) still has error of at most (ran) ' H k j. However, a simple calculation shows 
that, for Bj" 1 '^ a Bernoulli random vector with p[_gj m,n ) = e (0] — qf(m } n)Hij^n/m, 

d TV (C(X^ n \l)\Y^ n \l) = ^),C(B^ n) )) < |^gf(m,n)n fciV ^M 

I k=i 

It now follows from the Poisson thinning theorem (see for example Chapter 8 Section 6 
in [9]), and because 

j 

^U kj qJ(m,n)I[ lj qi c (Tn,n) = M^ n \k,l), 

3=1 

that 

d TV (c(X^ (1)| (0) = e {k) ) , ®f =1 Po (M { ™' n) (&,/))) 




an error of order 0(sjn/m). Thus, in this regime, the offspring distribution for the X^ m ' n ^ 
process, which determines the distribution of H 7 *™'™), approaches one with independent 
Poisson components, having means given by the matrix Mj. Again, this entails the 
convergence of U ,<jn,n ^ to U', but not the convergence of [/( m ' n ). 

Finally, if m/n is small, the simple bound (1 — p) 1 > 1 — Ip shows that 

P[Yj m ' n \l) 0] < {m/n) l ' 2 n kj q^{m,n), 

from which it follows that P[W > 0] < (m/n) 1 / 2 Y2j=i Qj{ m i n )^kj- Hence, for m/n — > 0, 
the distance between two randomly chosen vertices (ki, 1) and (&2, 1) is infinite, with 
probability close to 1. However, if the two vertices A and B do each have an edge 
joining them to the object set, then each is connected to just one object with conditional 
probability of order 1 — O(y^), and the objects to which they are linked are distinct 
with probability of order 1 — 0(l/m). The distance between these two objects can now 
be investigated, in this regime, by swapping the roles of vertices and objects, and using 
the theorems above. 

Thus if, in this scheme, m/n converges to a finite or infinite limit, the approximat- 
ing probability distributions C(U' ( - m ' n ^) remain relatively stable. In the error terms, the 
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quantities r( m,n ) and ■y( m < nS > converge to limits r and 7, the corresponding quantities for 
the limit matrix Mx- The factor n 1//4 e(m, n) 2 behaves like n _1//4 , and 9i like n~ 5 logn, 
for some 5 depending on Mx, as long as m/n is bounded below as n — > 00; in view of 
(13. 6p and (I6.17p . it follows that 5 < 1/4. The discussion above shows that m/n bounded 
below is the case of main interest. 

7 An exponential random graph model 

Rank 1 matrices P = a(3 T give rise to an exponential random graph model. The individual 
edges are independent, as before, and the probability of a vertex of type i connecting to 
an object of type j is of product form. These models have been extensively studied in 
the social science literature, see for example [20] and references therein; for applications 
to affiliation networks as bipartite networks, see for example [22J. 
In this case, 

j j 
M x (k,l) = '^a k p j m j P j aini = akaini^mj/3 2 , 
3=1 j=i 

so that 

M x = Caa T N X} 

with C = J2j=i m jPji has r = Ca T N X (y, \i = N X (y/l T Nx(x and v = C{t~ 1 1 t N x ct)ct- 
Here, 1 is a K x 1-vector of l's. 

As shown in the preceding sections, the principal eigenvalue r of Mx in our general 
multitype intersection graph is of critical importance in determining network distances. 
It turns out that its value can be bounded below by that obtained from an associated 
rank 1 matrix, adding to the importance of the exponential models. To see this, set 
D£ := J2j=iPkj m j to be the average degree of a type (k,l) vertex, and write s 2 B : = 
Et^D*) 2 . 

Proposition 7.1 If the values , 1 < k < K, are fixed, then 

t > s 2 B /m, 

and this value of t is attained by taking p k j = /m for all j ; with this choice of the 
Pkj 's, a vertex makes no distinction as to which types of object it has links to. The lower 
bound is minimized, ifJ2k=i^ > 'k = ^+ ^ s fi xe d, by taking = D*/n, so that all links 
have the same probability p^j = D+/(mn). 

PROOF: The proof is taken from [2], p. 15. The matrix Mx '■= PNyP T Nx has the same 
eigenvalues as the symmetric matrix 

V := N l J 2 PN Y P T N l J 2 . 
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Write Ukj '■— {D*) 1 mp k j — 1, and note that "Y^j=\ m j u kj — for each k. Then 

j 

V k i = y/rik ^ PkjmjPij y/n~, 
3=1 

= m~ l D* y/n^D* \fn~ < 1 + m~ x ^ in,/!!,-,/!,, 



From the Rayleigh-Ritz Theorem [TT], Theorem 4.2.2, it follows that r, the largest eigen- 
value of V, is at least as large as e T Ve/(e T e), for any e e R . Taking e k = y/rtk/ sb, 
1 < k < K, gives 

r > m _1 s^ + ||f H2, 



where 

K 

-'£- ,r,Vl2 

fc=i 



:= m 1 >^ n k (D%) 2 u kjy /rn], 1 < j < J. 



Since, with p^j = Dff/m for all /c, E takes the form m~ 1 ww T , with u> = N][ 2 D x , and 
hence has largest eigenvalue m _1 ||w||2 = s B /m, this proves the first statement of the 
proposition; the second is now immediate. □ 

Thus the value of r for a given P is always bigger than that corresponding to as homo- 
geneous a choice of the link probabilities as is allowed by the constraints on the average 
number of objects linked to a given vertex. 

Any rank one choice P = af3 T has a minimality property, analogous to that of Propo- 
sition 17.11 but of a less intuitive nature. The matrix P = a(3 T minimizes the maximum 
eigenvalue of Mx among all choices of P satisfying the constraint 

PN Y (3 = (f3 T N Y /3)a. 
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